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TRANSFORMING PIXEL DATA AND ADDRESSES 



Background 



This invention relates generally to graphics or video 
engines commonly used in computer systems . 

A graphics or video engine is responsible for 
5 processing or manipulating data received from a pixel 

source such as a processor, a graphics controller or other 
devices. The graphics/video engine takes the pixel data 
and operates on that pixel data in a variety of well known 
ways. For example, the color space of the pixel data may 

10 be converted using routine algorithms. Alternatively, the 
pixel data may be scaled to change the size of the 
displayed information, again using well known algorithms. 
Similarly, the pixel data may be subjected to composition 
effects such as blurring, contrasting or other distortions. 

15 In the conventional process, pixels are generated and 

deposited into memory as a result of a drawing operation 
such as three dimensional or video based rendering. To 
impose additional transformations like color space 
conversion or scaling, these pixels are then typically 

20 fetched from memory by a "fetch" engine. The operation is 
imposed and the pixels are then written back to the same 
memory location. The imposition of the transformation is 
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termed "active" because it requires an explicit fetch 
engine to be set up with the parameters of the operation. 

Thus, when a number of transformations are involved 
with given pixel data, many fetch engines may be needed 
5 that have the redundant functionality of generating pixel 
addresses. The use of these fetch engines complicates the 
memory controller that must operate with all of the fetch 
engines contending for memory bandwidth. 

The need to use the fetch engines may also reduce the 

10 available memory bandwidth because of the need for read- 
modify-write cycles when imposing filtering operations 
after rendering. Moreover, when multiple transformations 
are needed, the programmer must awkwardly impose the 
transformations by causing the fetch engine to manipulate 

15 the data between a memory location and the various 

transformation engines. In addition, there is no easy way 
to cause multiple transformations to occur in a serial 
fashion. 

Thus, there is a need for better ways to impose 
20 transformations on pixel data in graphic/video engines. 

Brief Description of the Drawings 
Figure 1 is a schematic depiction of the prior art; 
Figure 2 is a schematic depiction of one embodiment of 
the present invention; 
25 Figure 3 is a more detailed schematic depiction of one 

embodiment of the present invention; 
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Figure 4 shows the memory architecture of one 
embodiment of the present invention; 

Figure 5 is a flow chart for software useful in 
accordance with one embodiment of the present invention; 
5 Figure 6 is a flow chart useful with another 

embodiment of the present invention; 

Figure 7 shows a memory architecture in accordance 
with an embodiment of the present invention; and 

Figure 8 is a block depiction of a processor-based 
10 system in accordance with one embodiment of the present 
invention . 

Detailed Description 
Referring to Figure 1, in the conventional active 
system of imposing transformations on pixel data, the pixel 

15 data may be operated on by a plurality of transformation 
engines such as the scaling engine 100, the color 
conversion engine 106, and the composition engine 104. To 
do so, a fetch engine (not shown) fetches the data from 
memory and passes it to a first transformation engine, such 

20 as the scaling engine 100, as indicated by the arrow marked 
1. When the transformation is complete, the fetch engine 
returns the data to the memory 102 as indicated by the 
arrow 2. Similarly, the data may be shuttled between the 
color conversion engine 106 and the composition engine 104 

25 by using a fetch engine to fetch the data from memory 102, 
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and then fetch the data back from memory 102, normally to 
the same memory 102 location. 

In accordance with one embodiment of the present 
invention, shown in Figure 2, a serial architecture for 
pixel data transformations may be implemented. For 
example, the data may be fetched from memory 102 into a 
scaling engine 100 as indicated by the arrow 1. When the 
scaling is complete, the addresses for the data may be 
transformed to write the data to a color conversion engine 
106 as indicated by the arrow 2. After color conversion, 
the data addresses may be transformed so that the data is 
written directly into a composition engine 104. 
Thereafter, the composed data may be written back to the 
memory 102 as indicated by the arrow 4. Thus, the 
operations accomplished with embodiments of the present 
invention may be more efficient than those of the prior art 
illustrated in Figure 1. 

The pixel data operations in accordance with one 
embodiment of the present invention may be termed passive 
as compared to the active operations utilized in the prior 
art. An application may write pixels to a range of virtual 
memory addresses and the passive engine may impose the 
chosen operation. A new u re-mapped" memory address is 
generated and the pixel data is written to the new memory 
location. 



Any conventional transformation or operation performed 
on pixel data including but not limited to, scaling, color 
conversion and composition can be implemented using a 
passive engine. Moreover, as indicated in Figure 2, the 
multiple transformations may be tied together and performed 
in a serial fashion by setting the output addresses of one 
function to the same range as the input address of another 
function . 

As shown in Figure 3, a pixel source 12, that may be a 
processor, a graphics controller or another device, sends 
data and address information to a memory controller 14 . 
Some of the information passed from the pixel source 12 to 
the memory controller 14 may map to a virtual memory range 
of a media port target 16. The media port target 16 is a 
memory controller client that accepts pixel data written to 
virtual addresses set aside by the operating system. 

The media port target 16 may provide the data to a 
variety of transfer functions 18 whose purpose is to 
receive pixel data and addresses from the media port target 
16, perform a pixel and address transformation, and forward 
output pixel data and addresses to a media port write back 
engine 20. The media port write back engine 20 is a write- 
only memory controller client that receives pixel and 
address information from the individual transfer functions 
18 and forwards the pixel data and addresses to the memory 
controller 14. 



The operating system cooperates with the media port 
target 16 to set up a range of virtual memory addresses for 
use by the media port target 16. The operating system may 
set aside a specific memory range as virtual addresses that 
map to the media port target 16. Pixel data written to 
this memory range is not forwarded to physical memory. 
Instead, that pixel data is forwarded to the media port 
target 16. Each of the transfer functions 18 is also 
assigned a virtual memory range within the overall virtual 
memory range set aside for the media port target 16. 

Thus, referring to Figure 4, the total addressable 
memory 24 may include a portion 26 of virtual memory set 
aside for the media port target 16. Within the memory 
range 26, dedicated to the media port target 16, are 
transfer function address ranges 28i through 2 8 n/ each 
dedicated one of n transfer functions 18. Each transfer 
function 18 defined in the media port target 16 has a 
defined output memory address range 28. Pixel data written 
to a transfer function memory address range 28 goes through 
a transformation and has its addresses translated or re- 
addressed as indicated at 20. 

As a specific hypothetical example, the operating 
system may set aside the memory address range from 0x000000 
to 0x800000 (8 megabytes) as virtual addresses that map to 
the media port target 16. The range of addresses from 
0x000000 to 0x200000 may be set aside for a first transfer 



6 




function 18 such as color conversion. The range of 
addresses from 0x200000 to 0x800000 may be set aside for a 
second transfer function 18. The color conversion 
function's output memory address range may be defined to be 
5 from 0x800000 through OxAOOOOO. 

Each transfer function 18 performs a specific pixel 
transformation operation and address translation. The 
pixel data and addresses generated by the various transfer 
functions 18 may be written back to the memory controller 
10 14. While a single memory port write back engine 20 is 
illustrated in Figure 3, more than one media port write 
back engine 20 may be utilized to consolidate the pixel 
data and interfaces to the memory controller 14. 



RGB color space to YUV color space. For example, the base 
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As another specific hypothetical example, the transfer 
function 18a may be the color space transformation from the 



address "baseinput" of the input pixel data may be 
0x000000. The base address "baseoutput" of the transfer 
function 18 output pixel data may be 0x800000. Applying 
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the transfer function 18, the RGB to YUV conversion 



operation may be as follows: 



y=alxR+a2xG+a3xB ; 



u=blxR+b2xG+b3xB ; 



v=clxr+c2xG+c3xB . 



25 



The addresses of the pixel data after color space 
conversion are transformed from their original pixel 
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addresses. First, the offset to each pixel is determined 
by subtracting the baseinput from each pixel's original 
address. Then, the offset is added to the baseoutput to 
get the new address . 
5 In this way, each RGB pixel data may be converted 

through the transformation equation indicated above to 
output YUV pixel data. The output YUV pixel data is 
deposited at memory location 0x800000 to OxAOOOOO in an 
example in which the baseoutput is 0x800000 and the 

10 available memory range stops at OxAOOOOO. Similarly, the 
new addresses of the color space transformed pixel data is 
the offset added to the baseoutput to position the pixel 
data in easily determined locations, for the next transfer 
function or for return to memory. 

15 As another hypothetical example, a first transfer 

function, such as a color space conversion, may be 
implemented in the virtual memory address range from 
0x000000 to 0x200000 as indicated in Figure 5. A second 
transfer function, performing a 2:1 horizontal scaling, may 

2 0 be implemented in the virtual memory address range from 
0x200000 to 0x400000. A third transfer function, 
performing a blurring operation, may be implemented in the 
virtual memory address range from 0x400000 to 0x500000. 
The first through third transfer functions may be 

25 performed serially by setting the output memory address 

range of the first transfer function to start at 0x200000, 



and setting the output memory address range of the second 
transfer function to start at 0x400000. Pixel data written 
to the range from 0x000000 to 0x200000 is color converted 
and forwarded to the virtual memory address range from 
5 0x200000 through 0x400000. This pixel data is then scaled 
by horizontal averaging and written to the virtual memory 
address range from 0x400000 through 0x500000. The final 
output data of this sequence may written to the output 
memory address range designated by the third transfer 
10 function. 

?*% Transfer function software 32, shown in Figure 6, may 

^3 be stored in association with the memory controller 14, in 

£0 accordance with one embodiment of present invention. 

21 Initially, the software 32 determines whether the pixel 

CI 15 data has been received as indicated in diamond 34. If so, 
the first transfer function's transformation is performed 
as indicated in block 36. Thereafter, the address 
transformation is performed as indicated in block 38. 

An example of serialized transformations is shown in 
20 Figure 7. The software 40, in accordance with one 

embodiment of the present invention, determines in diamond 
42 when the pixel data has been received. Upon receipt of 
the pixel data, the first transfer function is performed as 
indicated in block 44. Thereafter, the output memory 
25 address range is set to the base address of the second 
transfer function as indicated in block 46. The second 
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transfer function may be performed followed by address 
transformation as indicated in block 48. The serial 
execution of transfer functions followed by address 
transformations may be extended to any number of transfer 
functions . 

Referring to Figure 8, in accordance with one 
embodiment of the present invention, the memory controller 
14 is coupled to system memory 50 and a bridge 56. The 
bridge 56 is in turn coupled to a processor 58 and a 
graphics controller 52. The graphics controller 52 may be 
coupled to a display 54. Thus, the pixel source 12 may be 
the processor 58, the graphics controller 52 or some other 
device. In one embodiment of the present invention, the 
software 32 and 40 may be stored in a storage associated 
with the memory controller 14. As is conventional, the 
bridge 56 is coupled to a bus 24. The bus 24 may include a 
peripheral device 26 as illustrated. 

In some embodiments of the present invention, there 
may be no need to use a plurality of fetch engines having 
redundant pixel address generating functionality. These 
fetch engines may complicate the memory controller that 
arbitrates among all of the engines contending for memory 
bandwidth. Transfer functions may be implemented, in 
accordance with embodiments of the present invention, under 
the same master memory controller although other 
implementations may also be possible. In addition, memory 



bandwidth utilization may be reduced by the elimination of 
the read-modify-write cycle typically used in active 
architectures when imposing filtering operations after 
rendering. In accordance with some embodiments of the 
present invention, pixel data being generated may be 
written directly to one or more successive memory-mapped 
media ports that perform the desired filter operations 
before returning the pixel data to memory. 

In some embodiments of the present invention, multiple 
filter operations may be flexibly allocated and serially 
applied through intelligent memory aliasing. If a 
programmer wants to impose color space transformations, 
followed by scaling, followed by convolution operations, 
the programmer may do so by simply mapping one operation's 
output into the input of the next operation. With 
conventional active systems, filter operations are 
typically implemented in the middle of a fetch and write 
back circuit. If several operations are implemented and 
only one is used, the other operations are effectively 
unusable for the duration of the operation because of the 
use of a single, standalone execution pipe. 

While the present invention has been described with 
respect to a limited number of embodiments, those skilled 
in the art will appreciate numerous modifications and 
variations therefrom. It is intended that the appended 



claims cover all such modifications and variations as fall 
within the true spirit and scope of this present invention. 
What is claimed is: 
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